feat(config): dynamic configuration reload during batch execution (#402) by frankbria · Pull Request #450 · frankbria/codeframe

frankbria · 2026-03-17T22:19:06Z

Summary

Implements #402: Hot-reload CODEFRAME.md/AGENTS.md/CLAUDE.md during long-running batch executions without restarting.

ConfigFileWatcher — Polling daemon thread (2s interval) monitoring config file mtimes, follows StallMonitor pattern
ConfigReloadState — Thread-safe shared state with last-known-good fallback on parse failures
Conductor integration — Watcher started/stopped in _execute_serial and _execute_parallel, reload checked between task dispatches
Batch status display — cf work batch status shows "Config reloaded at HH:MM:SS" when reloads occurred
Event emission — CONFIG_RELOADED and CONFIG_RELOAD_FAILED event types

Acceptance Criteria

Config file changes detected within 5 seconds during batch execution (poll interval = 2s)
Valid config changes applied to subsequent task dispatches
Invalid config changes logged as error, previous config retained
In-flight agent tasks not interrupted by config reload (check between tasks only)
cf work batch status shows "config reloaded at HH:MM:SS" if reload occurred
Integration test: modify config during batch, verify next task uses new config

Test Plan

10 unit tests for ConfigReloadState and ConfigFileWatcher
3 CLI tests for batch status display with reload timestamps
5 integration tests for full reload lifecycle
63 existing conductor tests pass unchanged
All 98 related tests passing
Linting clean (ruff)

Implementation Notes

No new dependencies — uses stdlib threading + os.stat polling (matching existing patterns)
No schema migration — reload timestamps stored in existing results JSON column via __config_reloads__ key
max_parallel reload deferred — changing concurrency mid-batch is complex; agent preferences (prompt/tooling) reload is sufficient since each task subprocess loads fresh config at launch
Watcher is optional — wrapped in try/except so batch execution never fails due to watcher issues

Closes #402

Summary by CodeRabbit

New Features
- Dynamic configuration hot-reloads during batch execution: watched config files are monitored and applied without stopping batches.
- Batch status output now shows per-batch configuration reload timestamps and a "Config Reloads" section.
Tests
- Added integration and unit tests validating config watcher behavior, reloads, error handling, and CLI display.

Polling-based file watcher for hot-reloading CODEFRAME.md/AGENTS.md/ CLAUDE.md during batch execution. Thread-safe shared state with last-known-good fallback on parse failure. Adds CONFIG_RELOADED and CONFIG_RELOAD_FAILED event types.

Start/stop ConfigFileWatcher in _execute_serial and _execute_parallel. Check for config reloads between task dispatches. Store reload timestamps in batch results. Display in batch status output.

End-to-end tests: modify config during watcher execution, verify reload detection, multiple reloads, invalid config preservation, has_reloaded_since tracking, and clean thread shutdown.

coderabbitai · 2026-03-17T22:19:22Z

Walkthrough

Adds a polling-based config file watcher and in-memory reload state; integrates watcher into the conductor to detect and apply config reloads between task dispatches; records reload timestamps in batch results and surfaces them in the CLI batch status output.

Changes

Cohort / File(s)	Summary
Configuration Watcher `codeframe/core/config_watcher.py`	New module: `ConfigReloadState` (thread-safe prefs + reload history + errors) and `ConfigFileWatcher` (polling thread, mtime detection, reload/apply logic, diff summary, event emission).
Conductor Integration `codeframe/core/conductor.py`	Initialize/start/stop `ConfigFileWatcher` (best-effort), check/apply pending reloads between tasks/groups, record reload timestamps into `batch.results`, ensure watcher cleanup in finally blocks, and emit related events during task lifecycle.
Events `codeframe/core/events.py`	Added event types `CONFIG_RELOADED` and `CONFIG_RELOAD_FAILED`.
CLI Display `codeframe/cli/app.py`	Render per-batch config reload timestamps from `batch.results["__config_reloads__"]`, parsing ISO timestamps with fallback to raw values and showing a "Config Reloads" section.
Tests — CLI `tests/cli/test_config_reload_cli.py`	New tests verifying CLI shows "Config Reloads" header and timestamp lines when present, and omits the section when none exist.
Tests — Core (Watcher) `tests/core/test_config_watcher.py`	Comprehensive unit tests for `ConfigReloadState` and `ConfigFileWatcher` behaviors (start/stop, mtime detection, invalid config handling, thread-safety).
Tests — Integration `tests/core/test_config_reload_integration.py`	End-to-end integration tests exercising watcher startup, multiple reloads, invalid reloads preserving last-good config, reload checkpointing, and clean shutdown.

Sequence Diagram(s)

sequenceDiagram
    actor OP as Operator
    participant COND as Conductor
    participant CW as ConfigFileWatcher
    participant FS as Filesystem
    participant TASKS as TaskDispatcher
    participant EV as EventSystem

    OP->>COND: Start batch execution
    COND->>CW: start(initial_prefs)
    CW->>FS: capture initial mtimes
    CW->>CW: spawn polling thread

    loop for each task/group
        COND->>CW: poll / check for pending reloads
        CW->>FS: compare mtimes
        alt file changed
            CW->>CW: reload() -> parse prefs
            alt valid
                CW->>CW: apply_reload(new_prefs, timestamp)
                CW->>EV: emit CONFIG_RELOADED (diff)
                CW-->>COND: notify reload detected
            else invalid
                CW->>CW: record_error(msg)
                CW->>EV: emit CONFIG_RELOAD_FAILED
                CW-->>COND: notify error (non-fatal)
            end
        end
        COND->>TASKS: dispatch next task with current prefs
    end

    OP->>COND: Batch completes
    COND->>CW: stop()
    CW->>CW: signal thread exit / join
    COND->>OP: return batch results (includes reload timestamps)

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

feat(config): unified CODEFRAME.md workflow config (#398) #445: Related to CODEFRAME.md parsing and preference-loading changes that the watcher relies on.
feat(core): continuous reconciliation during batch execution #439: Overlapping changes to conductor background thread lifecycle and batch execution integration points.

Poem

🐰 I nibble files beneath moonlight bright,
Change a line, I spot it in the night,
Reloaded prefs, applied without a fright,
Tasks keep hopping forward, tidy and fleet—
Hot-swap configs make batch runs sweet! 🥕✨

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 54.17% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly and concisely describes the main feature: dynamic configuration reload capability during batch execution, matching the core changeset across all modified files.
Linked Issues check	✅ Passed	The PR comprehensively implements all coding-related requirements from `#402`: file watching (ConfigFileWatcher), thread-safe state (ConfigReloadState), conductor integration with serial/parallel execution, config reload events, CLI status display, and test coverage.
Out of Scope Changes check	✅ Passed	All changes are directly scoped to the dynamic config reload feature: new watcher module, conductor integration, event types, CLI display, and comprehensive tests. No unrelated modifications detected.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch feature/issue-402-dynamic-config-reload

📝 Coding Plan

Generate coding plan for human review comments

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Tip

CodeRabbit can use Trivy to scan for security misconfigurations and secrets in Infrastructure as Code files.

Add a .trivyignore file to your project to customize which findings Trivy reports.

claude · 2026-03-17T22:21:45Z

Review: feat(config): dynamic configuration reload (Issue 402)

Good implementation overall - the architecture follows the existing StallMonitor pattern closely, the fallback-to-last-known-good is correct, and the test coverage is solid. A few issues worth addressing before merge.

BUGS

OSError not caught in newly-created file detection

_check_for_changes wraps the first mtime loop in try/except OSError, but the second loop (detecting newly-created files) does not:

for name in _CONFIG_FILES:
    path = self._workspace_path / name
    if path.is_file() and path not in self._watched_mtimes:
        self._watched_mtimes[path] = path.stat().st_mtime  # no try/except

If a file is created and deleted between is_file() and stat(), this raises an unhandled OSError in the daemon thread, silently killing the watcher. Wrap in the same try/except as the first loop.

Dead expression (pre-existing, now more prominent)

Both _execute_serial and _execute_parallel have:

completed_count + failed_count + blocked_count

This evaluates but the result is discarded. Not introduced by this PR but now sits inside the new try block, making it more visible.

DESIGN

Unnecessary _p suffix in _execute_parallel

Local variables config_watcher_p, reload_state_p, _last_seen_reload_p have a _p suffix that is not needed since they are in a separate function scope from the serial equivalents. The suffix adds visual noise without preventing any shadowing.

DB access from daemon thread in emit methods

_emit_reload_success and _emit_reload_failed call get_workspace(self._workspace_path), which hits SQLite from the watcher daemon thread. While wrapped in try/except, it introduces DB contention. Since the workspace path is known at construction time, consider passing the workspace in at start() or storing a reference rather than re-fetching on every reload event.

Unbounded reload_timestamps list

ConfigReloadState.reload_timestamps grows without bound. For multi-hour batches with frequent config changes or a flapping config file, this list could accumulate indefinitely. Consider capping at a reasonable limit (e.g. 100 most recent).

MINOR

Import inside loop in app.py

for ts in config_reloads:
try:
from datetime import datetime as _dt # repeated on each iteration

Python caches module imports so this is not a perf issue, but it reads oddly. Move the import above the loop.

What is working well:

The daemon thread lifecycle (start/stop/idempotent-stop) mirrors StallMonitor cleanly
try/except around watcher startup ensures batch execution is never broken by watcher failures
has_reloaded_since and the lock discipline in ConfigReloadState are correct
_build_diff_summary is a nice touch for observability
Integration tests use os.utime to force mtime advancement, avoiding filesystem timestamp resolution issues
The config_reloads key approach avoids a schema migration

The OSError gap in the newly-created file detection (point 1) is the only change I would call required before merge.

coderabbitai

Actionable comments posted: 7

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@codeframe/core/conductor.py`:
- Line 1641: The standalone expression completed_count + failed_count +
blocked_count is dead code; follow the same fix used in _execute_serial by
computing and using the total instead of discarding it. Replace the expression
with an assignment like finished_count = completed_count + failed_count +
blocked_count and then either return finished_count from the enclosing function
or use finished_count in the subsequent logic where the total is needed; do not
leave the sum as an unused expression.
- Line 1459: There is a dead expression computing completed_count + failed_count
+ blocked_count that isn’t assigned or used; remove that standalone expression
or, if the intent was to capture the total, replace it with an assignment like
total_count = completed_count + failed_count + blocked_count and use total_count
where needed (or return/log it). Locate the orphan expression referencing
completed_count, failed_count and blocked_count in the conductor code and either
delete it or convert it into a used variable/return value so the computed sum is
not discarded.
- Around line 1308-1332: The function _apply_pending_config_reload currently
only records reloads in batch.results and logs them; update it to emit audit
events by importing and using the event API (e.g., CONFIG_RELOADED and
CONFIG_RELOAD_FAILED plus the emitter function from core.events). Wrap the
has_reloaded_since/check-and-append logic in a try/except: on success emit
CONFIG_RELOADED with a payload containing identifying info (workspace id/name,
batch id, and the ISO timestamp 'now'), then save the batch and return now; on
exception emit CONFIG_RELOAD_FAILED with workspace/batch ids and the error
details, ensure the error is logged, and return last_seen_reload (or rethrow if
that matches project convention). Reference symbols:
_apply_pending_config_reload, reload_state.has_reloaded_since, CONFIG_RELOADED,
CONFIG_RELOAD_FAILED.

In `@codeframe/core/config_watcher.py`:
- Around line 122-127: The watcher currently only snapshots workspace-level
files and ignores inherited files/deletions, causing missed reloads; update
_snapshot_watched_files() to mirror load_preferences() by building the full
effective config set (repo-local files plus parent-directory traversals and the
user-level ~/.codeframe/AGENTS.md/CODEFRAME.md/CLAUDE.md used by
load_preferences()), populate self._watched_mtimes with entries for every path
in that set (use None for missing files) instead of only existing workspace
files, and change the change-detection logic that compares against
self._watched_mtimes to treat a transition between None and a real mtime (or
differing mtimes) as a trigger to call _reload(); ensure names referenced are
_snapshot_watched_files(), load_preferences(), _watched_mtimes, and _reload so
the watcher and loader remain in sync.
- Around line 122-127: The loop that populates self._watched_mtimes for names in
_CONFIG_FILES inside start() and the "new file" detection that assigns
self._watched_mtimes[path] = path.stat().st_mtime in the watch/poll loop must
guard against a race where the file is removed between is_file() and stat();
wrap the path.stat() call in a try/except FileNotFoundError (mirroring the
existing try/except used earlier around stat in the poll logic) and skip/ignore
that file when the exception occurs (optionally log at debug), ensuring the
daemon thread doesn't crash.
- Around line 50-74: The class exposes mutable internals: get_prefs returns a
live AgentPreferences and public fields reload_timestamps, last_error,
last_reload_at are mutable; change get_prefs to return a defensive copy (e.g.,
copy.deepcopy(self._prefs) or an immutable snapshot) and stop exposing mutable
lists directly by making reload_timestamps private and returning a copy (e.g.,
list(self._reload_timestamps) or tuple) from an accessor; ensure apply_reload
updates the private storage (_prefs, _reload_timestamps) and
record_error/has_reloaded_since continue to operate on private fields while
external readers only get immutable or copied snapshots to avoid cross-thread
mutation.

In `@tests/core/test_config_reload_integration.py`:
- Around line 21-25: The workspace fixture currently uses the real HOME so
global ~/.codeframe/AGENTS.md can bleed into tests; update the fixture to be
hermetic by accepting the pytest monkeypatch fixture (def workspace(tmp_path,
monkeypatch):), then set monkeypatch.setenv("HOME", str(tmp_path)) (and
optionally monkeypatch.setenv("XDG_CONFIG_HOME", str(tmp_path)) to be safe)
before calling create_or_load_workspace(repo_path) so load_preferences() will
only see the empty temp location instead of the developer/CI home; keep the rest
of the fixture (repo_path.mkdir() and return
create_or_load_workspace(repo_path)) the same.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 3a791b88-fce7-4851-8e8d-73d9a263d73b

📥 Commits

Reviewing files that changed from the base of the PR and between cd3e985 and cbc757c.

📒 Files selected for processing (7)

codeframe/cli/app.py
codeframe/core/conductor.py
codeframe/core/config_watcher.py
codeframe/core/events.py
tests/cli/test_config_reload_cli.py
tests/core/test_config_reload_integration.py
tests/core/test_config_watcher.py

coderabbitai · 2026-03-17T22:29:28Z

+def _apply_pending_config_reload(
+    batch: BatchRun,
+    workspace: Workspace,
+    reload_state: "ConfigReloadState",
+    last_seen_reload: datetime,
+) -> datetime:
+    """Check for config reloads and record them in batch results.
+
+    Args:
+        batch: Current batch run.
+        workspace: Target workspace.
+        reload_state: Shared reload state from ConfigFileWatcher.
+        last_seen_reload: Timestamp of the last check.
+
+    Returns:
+        Updated last_seen_reload timestamp.
+    """
+    if reload_state.has_reloaded_since(last_seen_reload):
+        now = datetime.now(timezone.utc)
+        reloads = batch.results.setdefault("__config_reloads__", [])
+        reloads.append(now.isoformat())
+        _save_batch(workspace, batch)
+        print(f"  [config] Configuration reloaded at {now.strftime('%H:%M:%S')}")
+        return now
+    return last_seen_reload


⚠️ Potential issue | 🟠 Major

Missing CONFIG_RELOADED event emission.

The PR objectives state that CONFIG_RELOADED and CONFIG_RELOAD_FAILED events should be emitted, but this function only appends to batch results and prints a log message without emitting any events.

As per coding guidelines: "Core modules in codeframe/core/ must emit events for state transitions via core/events.py for audit and observability."

🛠️ Proposed fix to emit CONFIG_RELOADED event

def _apply_pending_config_reload( batch: BatchRun, workspace: Workspace, reload_state: "ConfigReloadState", last_seen_reload: datetime, ) -> datetime: """Check for config reloads and record them in batch results. Args: batch: Current batch run. workspace: Target workspace. reload_state: Shared reload state from ConfigFileWatcher. last_seen_reload: Timestamp of the last check. Returns: Updated last_seen_reload timestamp. """ if reload_state.has_reloaded_since(last_seen_reload): now = datetime.now(timezone.utc) reloads = batch.results.setdefault("__config_reloads__", []) reloads.append(now.isoformat()) _save_batch(workspace, batch) print(f" [config] Configuration reloaded at {now.strftime('%H:%M:%S')}") + events.emit_for_workspace( + workspace, + events.EventType.CONFIG_RELOADED, + {"batch_id": batch.id, "timestamp": now.isoformat()}, + print_event=False, + ) return now return last_seen_reload

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

def _apply_pending_config_reload(

batch: BatchRun,

workspace: Workspace,

reload_state: "ConfigReloadState",

last_seen_reload: datetime,

) -> datetime:

"""Check for config reloads and record them in batch results.

Args:

batch: Current batch run.

workspace: Target workspace.

reload_state: Shared reload state from ConfigFileWatcher.

last_seen_reload: Timestamp of the last check.

Returns:

Updated last_seen_reload timestamp.

"""

if reload_state.has_reloaded_since(last_seen_reload):

now = datetime.now(timezone.utc)

reloads = batch.results.setdefault("__config_reloads__", [])

reloads.append(now.isoformat())

_save_batch(workspace, batch)

print(f" [config] Configuration reloaded at {now.strftime('%H:%M:%S')}")

return now

return last_seen_reload

def _apply_pending_config_reload(

batch: BatchRun,

workspace: Workspace,

reload_state: "ConfigReloadState",

last_seen_reload: datetime,

) -> datetime:

"""Check for config reloads and record them in batch results.

Args:

batch: Current batch run.

workspace: Target workspace.

reload_state: Shared reload state from ConfigFileWatcher.

last_seen_reload: Timestamp of the last check.

Returns:

Updated last_seen_reload timestamp.

"""

if reload_state.has_reloaded_since(last_seen_reload):

now = datetime.now(timezone.utc)

reloads = batch.results.setdefault("__config_reloads__", [])

reloads.append(now.isoformat())

_save_batch(workspace, batch)

print(f" [config] Configuration reloaded at {now.strftime('%H:%M:%S')}")

events.emit_for_workspace(

workspace,

events.EventType.CONFIG_RELOADED,

{"batch_id": batch.id, "timestamp": now.isoformat()},

print_event=False,

)

return now

return last_seen_reload

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@codeframe/core/conductor.py` around lines 1308 - 1332, The function _apply_pending_config_reload currently only records reloads in batch.results and logs them; update it to emit audit events by importing and using the event API (e.g., CONFIG_RELOADED and CONFIG_RELOAD_FAILED plus the emitter function from core.events). Wrap the has_reloaded_since/check-and-append logic in a try/except: on success emit CONFIG_RELOADED with a payload containing identifying info (workspace id/name, batch id, and the ISO timestamp 'now'), then save the batch and return now; on exception emit CONFIG_RELOAD_FAILED with workspace/batch ids and the error details, ensure the error is logged, and return last_seen_reload (or rethrow if that matches project convention). Reference symbols: _apply_pending_config_reload, reload_state.has_reloaded_since, CONFIG_RELOADED, CONFIG_RELOAD_FAILED.

coderabbitai · 2026-03-17T22:29:28Z

+
+        # Determine final batch status
+        total = len(batch.task_ids)
+        completed_count + failed_count + blocked_count


⚠️ Potential issue | 🟡 Minor

Dead code: expression computed but never used.

This line computes a sum but doesn't assign or use the result. Likely a leftover or debugging artifact.

🔧 Proposed fix

- completed_count + failed_count + blocked_count

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

completed_count + failed_count + blocked_count

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@codeframe/core/conductor.py` at line 1459, There is a dead expression computing completed_count + failed_count + blocked_count that isn’t assigned or used; remove that standalone expression or, if the intent was to capture the total, replace it with an assignment like total_count = completed_count + failed_count + blocked_count and use total_count where needed (or return/log it). Locate the orphan expression referencing completed_count, failed_count and blocked_count in the conductor code and either delete it or convert it into a used variable/return value so the computed sum is not discarded.

coderabbitai · 2026-03-17T22:29:28Z

+
+        # Determine final batch status
+        total = len(batch.task_ids)
+        completed_count + failed_count + blocked_count


⚠️ Potential issue | 🟡 Minor

Dead code: same issue as in _execute_serial.

Expression computed but result is discarded.

🔧 Proposed fix

- completed_count + failed_count + blocked_count

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

completed_count + failed_count + blocked_count

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@codeframe/core/conductor.py` at line 1641, The standalone expression completed_count + failed_count + blocked_count is dead code; follow the same fix used in _execute_serial by computing and using the total instead of discarding it. Replace the expression with an assignment like finished_count = completed_count + failed_count + blocked_count and then either return finished_count from the enclosing function or use finished_count in the subsequent logic where the total is needed; do not leave the sum as an unused expression.

coderabbitai · 2026-03-17T22:29:28Z

+    def __init__(self, initial_prefs: AgentPreferences) -> None:
+        self._prefs = initial_prefs
+        self._lock = threading.Lock()
+        self.last_reload_at: Optional[datetime] = None
+        self.last_error: Optional[str] = None
+        self.reload_timestamps: list[str] = []
+
+    def get_prefs(self) -> AgentPreferences:
+        with self._lock:
+            return self._prefs
+
+    def apply_reload(self, new_prefs: AgentPreferences, timestamp: datetime) -> None:
+        with self._lock:
+            self._prefs = new_prefs
+            self.last_reload_at = timestamp
+            self.last_error = None
+            self.reload_timestamps.append(timestamp.isoformat())
+
+    def record_error(self, msg: str) -> None:
+        with self._lock:
+            self.last_error = msg
+
+    def has_reloaded_since(self, since: datetime) -> bool:
+        with self._lock:
+            return self.last_reload_at is not None and self.last_reload_at > since


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash nl -ba codeframe/core/config_watcher.py | sed -n '50,74p' printf '\n' nl -ba codeframe/core/agents_config.py | sed -n '31,60p'

Repository: frankbria/codeframe

Length of output: 144

🏁 Script executed:

cat -n codeframe/core/config_watcher.py | sed -n '50,74p'

Repository: frankbria/codeframe

Length of output: 1192

🏁 Script executed:

fd -t f -n "agents_config.py" codeframe/core/

Repository: frankbria/codeframe

Length of output: 293

🏁 Script executed:

fd -t f -n "*.py" codeframe/core/ | grep -E "(config|preference)" | head -20

Repository: frankbria/codeframe

Length of output: 293

🏁 Script executed:

rg -A 10 "class AgentPreferences" codeframe/core/

Repository: frankbria/codeframe

Length of output: 1002

🏁 Script executed:

rg "get_prefs()" codeframe/ -A 3 -B 1

Repository: frankbria/codeframe

Length of output: 1072

🏁 Script executed:

rg "class AgentPreferences" codeframe/core/ -A 30

Repository: frankbria/codeframe

Length of output: 2406

🏁 Script executed:

rg "always_do|ask_first|never_do" codeframe/core/agents_config.py -B 2 -A 2

Repository: frankbria/codeframe

Length of output: 3273

🏁 Script executed:

rg "get_prefs()" codeframe/ -B 2 -A 5 | head -60

Repository: frankbria/codeframe

Length of output: 1637

🏁 Script executed:

rg "\.always_do|\.ask_first|\.never_do" codeframe/ -A 2

Repository: frankbria/codeframe

Length of output: 3238

ConfigReloadState leaks mutable state across threads without proper encapsulation.

get_prefs() returns a live reference to the internal AgentPreferences object, which contains mutable lists (always_do, ask_first, never_do, source_files) and dicts (tooling, code_style, commands). After the lock is released, callers can freely mutate these collections outside any synchronization. Additionally, public fields like reload_timestamps (a mutable list), last_error, and last_reload_at are directly accessible and modifiable. Thread-safety here relies only on caller convention, not enforcement. Return defensive copies via copy.deepcopy() or a frozen snapshot object to prevent accidental mutations.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@codeframe/core/config_watcher.py` around lines 50 - 74, The class exposes mutable internals: get_prefs returns a live AgentPreferences and public fields reload_timestamps, last_error, last_reload_at are mutable; change get_prefs to return a defensive copy (e.g., copy.deepcopy(self._prefs) or an immutable snapshot) and stop exposing mutable lists directly by making reload_timestamps private and returning a copy (e.g., list(self._reload_timestamps) or tuple) from an accessor; ensure apply_reload updates the private storage (_prefs, _reload_timestamps) and record_error/has_reloaded_since continue to operate on private fields while external readers only get immutable or copied snapshots to avoid cross-thread mutation.

coderabbitai · 2026-03-17T22:29:28Z

+        # Snapshot current mtimes
+        self._watched_mtimes = {}
+        for name in _CONFIG_FILES:
+            path = self._workspace_path / name
+            if path.is_file():
+                self._watched_mtimes[path] = path.stat().st_mtime


⚠️ Potential issue | 🟠 Major

Watch the full effective config set, including deletions.

load_preferences() merges repo-local files, parent-directory files, and ~/.codeframe/AGENTS.md, but this watcher only snapshots workspace_path / {CODEFRAME, AGENTS, CLAUDE}.md and explicitly treats deleted files as a no-op. Editing or removing an inherited config therefore won't trigger _reload(), so later tasks can keep running with stale preferences until some repo-local file changes.

💡 Fix direction

- self._watched_mtimes = {} - for name in _CONFIG_FILES: - path = self._workspace_path / name - if path.is_file(): - self._watched_mtimes[path] = path.stat().st_mtime + self._watched_mtimes = self._snapshot_watched_files() ... - changed = False - ... - if changed: + current_mtimes = self._snapshot_watched_files() + changed = current_mtimes != self._watched_mtimes + self._watched_mtimes = current_mtimes + if changed: self._reload()

_snapshot_watched_files() should mirror codeframe/core/agents_config.py:load_preferences() so the watcher and loader stay in sync.

Also applies to: 153-180

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@codeframe/core/config_watcher.py` around lines 122 - 127, The watcher currently only snapshots workspace-level files and ignores inherited files/deletions, causing missed reloads; update _snapshot_watched_files() to mirror load_preferences() by building the full effective config set (repo-local files plus parent-directory traversals and the user-level ~/.codeframe/AGENTS.md/CODEFRAME.md/CLAUDE.md used by load_preferences()), populate self._watched_mtimes with entries for every path in that set (use None for missing files) instead of only existing workspace files, and change the change-detection logic that compares against self._watched_mtimes to treat a transition between None and a real mtime (or differing mtimes) as a trigger to call _reload(); ensure names referenced are _snapshot_watched_files(), load_preferences(), _watched_mtimes, and _reload so the watcher and loader remain in sync.

⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash nl -ba codeframe/core/config_watcher.py | sed -n '122,127p;172,176p'

Repository: frankbria/codeframe

Length of output: 103

🏁 Script executed:

# First, let's check the file size and read the target sections wc -l codeframe/core/config_watcher.py

Repository: frankbria/codeframe

Length of output: 99

🏁 Script executed:

# Read lines 122-127 and surrounding context to understand the code cat -n codeframe/core/config_watcher.py | sed -n '115,135p'

Repository: frankbria/codeframe

Length of output: 854

🏁 Script executed:

# Read lines 173-176 and surrounding context cat -n codeframe/core/config_watcher.py | sed -n '165,185p'

Repository: frankbria/codeframe

Length of output: 943

🏁 Script executed:

# Search for _poll_loop method and its exception handling rg -n "_poll_loop|except|FileNotFoundError" codeframe/core/config_watcher.py -A 2

Repository: frankbria/codeframe

Length of output: 803

🏁 Script executed:

# Check the entire file structure to understand class and method organization cat -n codeframe/core/config_watcher.py | grep -E "^\s*(def|class)" | head -20

Repository: frankbria/codeframe

Length of output: 45

🏁 Script executed:

# Read the complete _poll_loop method to see exception handling scope cat -n codeframe/core/config_watcher.py | sed -n '144,180p'

Repository: frankbria/codeframe

Length of output: 1647

🏁 Script executed:

# Verify FileNotFoundError is OSError subclass and check if there's any outer exception handling python3 << 'EOF' # FileNotFoundError is a subclass of OSError in Python 3.3+ print("FileNotFoundError is OSError:", issubclass(FileNotFoundError, OSError)) EOF

Repository: frankbria/codeframe

Length of output: 97

🏁 Script executed:

# Double-check by looking for any try-except around _check_for_changes call cat -n codeframe/core/config_watcher.py | sed -n '144,148p'

Repository: frankbria/codeframe

Length of output: 311

Add exception handling for the file stat() race at lines 122-127 and 173-176.

Race condition: editors that save via atomic rename can delete the file between the is_file() check and the stat() call, raising FileNotFoundError. The first loop (lines 153–170) protects against this with a try-except at lines 162-165, but the "new file" detection loop (lines 173-176) does not. When stat() fails at line 176, the unhandled exception kills the daemon thread silently, stopping hot-reload for the rest of the polling cycle.

Wrap the stat() call at line 176 in a try-except block (similar to lines 162-165), and add the same protection at line 127 in the start() method.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@codeframe/core/config_watcher.py` around lines 122 - 127, The loop that populates self._watched_mtimes for names in _CONFIG_FILES inside start() and the "new file" detection that assigns self._watched_mtimes[path] = path.stat().st_mtime in the watch/poll loop must guard against a race where the file is removed between is_file() and stat(); wrap the path.stat() call in a try/except FileNotFoundError (mirroring the existing try/except used earlier around stat in the poll logic) and skip/ignore that file when the exception occurs (optionally log at debug), ensuring the daemon thread doesn't crash.

coderabbitai · 2026-03-17T22:29:28Z

+@pytest.fixture
+def workspace(tmp_path: Path):
+    repo_path = tmp_path / "test_repo"
+    repo_path.mkdir()
+    return create_or_load_workspace(repo_path)


⚠️ Potential issue | 🟠 Major

Make this fixture hermetic to global CodeFRAME defaults.

load_preferences() also merges ~/.codeframe/AGENTS.md, but this fixture uses the runner's real home directory. Any developer/CI machine with global defaults can change the merged preferences and make these assertions nondeterministic. Patch the home directory (or the global-config lookup) to an empty temp location before the tests run. The same setup pattern appears in tests/core/test_config_watcher.py.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@tests/core/test_config_reload_integration.py` around lines 21 - 25, The workspace fixture currently uses the real HOME so global ~/.codeframe/AGENTS.md can bleed into tests; update the fixture to be hermetic by accepting the pytest monkeypatch fixture (def workspace(tmp_path, monkeypatch):), then set monkeypatch.setenv("HOME", str(tmp_path)) (and optionally monkeypatch.setenv("XDG_CONFIG_HOME", str(tmp_path)) to be safe) before calling create_or_load_workspace(repo_path) so load_preferences() will only see the empty temp location instead of the developer/CI home; keep the rest of the fixture (repo_path.mkdir() and return create_or_load_workspace(repo_path)) the same.

…orkspace (#402) - Wrap newly-created file detection in try/except OSError - Cap reload_timestamps at 100 entries - Cache workspace reference to avoid DB access on every emit - Move import outside loop in batch_status

coderabbitai

♻️ Duplicate comments (3)

codeframe/core/config_watcher.py (3)

127-131: ⚠️ Potential issue | 🟠 Major

Guard startup mtime snapshot against TOCTOU race.

At Line [130], path.stat() can raise after path.is_file() if the file is atomically replaced/deleted during startup, which can fail watcher initialization.

Suggested fix

         for name in _CONFIG_FILES:
             path = self._workspace_path / name
             if path.is_file():
-                self._watched_mtimes[path] = path.stat().st_mtime
+                try:
+                    self._watched_mtimes[path] = path.stat().st_mtime
+                except OSError:
+                    continue

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@codeframe/core/config_watcher.py` around lines 127 - 131, The startup mtime
snapshot loop using _CONFIG_FILES currently calls path.is_file() then
path.stat(), which can raise if the file is removed/replace between those calls;
update the loop in config_watcher.py (the block that populates
self._watched_mtimes) to guard the stat with a try/except (catch
FileNotFoundError/OSError) and skip the path when stat fails, so you only add
entries when path.stat() succeeds; reference _CONFIG_FILES, path.is_file(),
path.stat().st_mtime and self._watched_mtimes when making the change.

53-56: ⚠️ Potential issue | 🟠 Major

ConfigReloadState still leaks mutable cross-thread state.

Line [57] returns a live AgentPreferences object, and Lines [53]-[56] expose mutable/public state. External callers can mutate shared state outside the lock.

Suggested hardening

+import copy
 ...
 class ConfigReloadState:
     def __init__(self, initial_prefs: AgentPreferences) -> None:
         self._prefs = initial_prefs
         self._lock = threading.Lock()
-        self.last_reload_at: Optional[datetime] = None
-        self.last_error: Optional[str] = None
-        self.reload_timestamps: list[str] = []
+        self._last_reload_at: Optional[datetime] = None
+        self._last_error: Optional[str] = None
+        self._reload_timestamps: list[str] = []

     def get_prefs(self) -> AgentPreferences:
         with self._lock:
-            return self._prefs
+            return copy.deepcopy(self._prefs)
+
+    def get_last_reload_at(self) -> Optional[datetime]:
+        with self._lock:
+            return self._last_reload_at
+
+    def get_last_error(self) -> Optional[str]:
+        with self._lock:
+            return self._last_error
+
+    def get_reload_timestamps(self) -> tuple[str, ...]:
+        with self._lock:
+            return tuple(self._reload_timestamps)

Also applies to: 57-59

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@codeframe/core/config_watcher.py` around lines 53 - 56, ConfigReloadState
currently exposes mutable cross-thread state (attributes last_reload_at,
last_error, reload_timestamps and the AgentPreferences object returned live), so
callers can mutate shared state outside the lock; fix by making internal state
private and only returning immutable copies under the lock: e.g., rename fields
to _last_reload_at, _last_error, _reload_timestamps and store reload_timestamps
as a tuple (or always return a copy), and change the accessor that returns
AgentPreferences (look for the method that returns AgentPreferences) to return a
deep/shallow copy or an immutable/frozen representation while holding the same
lock that protects state. Ensure all external accessors read state under the
lock and never expose the live mutable objects directly.

158-163: ⚠️ Potential issue | 🟠 Major

Deleted config files never trigger reload.

Lines [162]-[163] skip deletions without marking a change, so removing a watched config file does not call _reload(), leaving stale settings active.

Suggested fix

         for name in _CONFIG_FILES:
             path = self._workspace_path / name
             if not path.is_file():
                 # File might have been created since start
                 if path not in self._watched_mtimes:
                     continue
-                # File was deleted — skip
-                continue
+                # File was deleted — trigger reload and drop stale mtime
+                del self._watched_mtimes[path]
+                changed = True
+                continue

Also applies to: 185-186

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@codeframe/core/config_watcher.py` around lines 158 - 163, The watcher
currently skips deleted files without marking them changed so removals never
trigger _reload(); update the branch that handles "if not path.is_file()" to
detect deletions by checking if path is present in self._watched_mtimes, and
when it is, remove it from self._watched_mtimes (or mark it as changed) and call
the reload path (e.g., invoke self._reload() or enqueue the change) so deleted
configs trigger a reload; make the same change for the analogous block that
checks path existence around the lines referencing self._watched_mtimes (the
second occurrence noted) to ensure deletions are handled consistently.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Duplicate comments:
In `@codeframe/core/config_watcher.py`:
- Around line 127-131: The startup mtime snapshot loop using _CONFIG_FILES
currently calls path.is_file() then path.stat(), which can raise if the file is
removed/replace between those calls; update the loop in config_watcher.py (the
block that populates self._watched_mtimes) to guard the stat with a try/except
(catch FileNotFoundError/OSError) and skip the path when stat fails, so you only
add entries when path.stat() succeeds; reference _CONFIG_FILES, path.is_file(),
path.stat().st_mtime and self._watched_mtimes when making the change.
- Around line 53-56: ConfigReloadState currently exposes mutable cross-thread
state (attributes last_reload_at, last_error, reload_timestamps and the
AgentPreferences object returned live), so callers can mutate shared state
outside the lock; fix by making internal state private and only returning
immutable copies under the lock: e.g., rename fields to _last_reload_at,
_last_error, _reload_timestamps and store reload_timestamps as a tuple (or
always return a copy), and change the accessor that returns AgentPreferences
(look for the method that returns AgentPreferences) to return a deep/shallow
copy or an immutable/frozen representation while holding the same lock that
protects state. Ensure all external accessors read state under the lock and
never expose the live mutable objects directly.
- Around line 158-163: The watcher currently skips deleted files without marking
them changed so removals never trigger _reload(); update the branch that handles
"if not path.is_file()" to detect deletions by checking if path is present in
self._watched_mtimes, and when it is, remove it from self._watched_mtimes (or
mark it as changed) and call the reload path (e.g., invoke self._reload() or
enqueue the change) so deleted configs trigger a reload; make the same change
for the analogous block that checks path existence around the lines referencing
self._watched_mtimes (the second occurrence noted) to ensure deletions are
handled consistently.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 90297da6-5c9c-4f22-aeed-f71694b448e3

📥 Commits

Reviewing files that changed from the base of the PR and between cbc757c and c501ceb.

📒 Files selected for processing (2)

codeframe/cli/app.py
codeframe/core/config_watcher.py

claude · 2026-03-17T22:34:46Z

Follow-up to my earlier review — checking what the latest commit (c501ceb) addressed.

Resolved from prior review

reload_timestamps unbounded growth — cap at 100 entries is now in place. Good.
config_watcher.py changes in the new commit appear to have tightened the newly-created file detection path.

Still open

OSError race in newly-created file detection — CodeRabbit's latest pass (cbc757c → c501ceb) confirms the path.stat() call in the new-file detection loop still has no try/except FileNotFoundError. The fix is one line: wrap self._watched_mtimes[path] = path.stat().st_mtime in a try/except (OSError, FileNotFoundError): continue. This remains the only change I'd call required before merge.
Dead expression — completed_count + failed_count + blocked_count in both _execute_serial and _execute_parallel is still a silent no-op. Not a regression from this PR, but it's now prominent inside the new try block. Worth a quick fix.
_p suffix naming in _execute_parallel (config_watcher_p, reload_state_p, etc.) — still present. These are local to their function scope so there is no collision risk; the suffix is just noise.
datetime import inside the for-loop in app.py — CodeRabbit flagged this again in the latest pass. Python caches it so it is not a perf problem, but it reads oddly. A top-of-function import cleans it up.

Summary

The architecture is solid and the test coverage across unit/integration/CLI layers is thorough. One required fix remains (OSError gap in point 1). Everything else is polish.

Test User added 3 commits March 17, 2026 08:24

feat(config): integrate config watcher into batch execution (#402)

08f4eb1

Start/stop ConfigFileWatcher in _execute_serial and _execute_parallel. Check for config reloads between task dispatches. Store reload timestamps in batch results. Display in batch status output.

test(config): add integration tests for config reload lifecycle (#402)

cbc757c

End-to-end tests: modify config during watcher execution, verify reload detection, multiple reloads, invalid config preservation, has_reloaded_since tracking, and clean thread shutdown.

coderabbitai Bot reviewed Mar 17, 2026

View reviewed changes

frankbria merged commit 4308b0a into main Mar 17, 2026
11 checks passed

This was referenced Mar 19, 2026

README has stale [planned] markers for shipped features #452

Closed

No CI pipeline: PRs merge without automated test gate #453

Closed

frankbria deleted the feature/issue-402-dynamic-config-reload branch March 24, 2026 23:28

coderabbitai Bot mentioned this pull request Apr 3, 2026

[Phase 4.B] Worktree isolation for parallel batch execution #533

Closed

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(config): dynamic configuration reload during batch execution (#402)#450

feat(config): dynamic configuration reload during batch execution (#402)#450
frankbria merged 4 commits into
mainfrom
feature/issue-402-dynamic-config-reload

frankbria commented Mar 17, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Mar 17, 2026 •

edited

Loading

❌ Failed checks (1 warning)

Uh oh!

claude Bot commented Mar 17, 2026

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot Mar 17, 2026

Uh oh!

coderabbitai Bot Mar 17, 2026

Uh oh!

coderabbitai Bot Mar 17, 2026

Uh oh!

coderabbitai Bot Mar 17, 2026

Uh oh!

coderabbitai Bot Mar 17, 2026

Uh oh!

coderabbitai Bot Mar 17, 2026

Uh oh!

coderabbitai Bot left a comment

Uh oh!

claude Bot commented Mar 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

frankbria commented Mar 17, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Acceptance Criteria

Test Plan

Implementation Notes

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

claude Bot commented Mar 17, 2026

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

claude Bot commented Mar 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

frankbria commented Mar 17, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Mar 17, 2026 •

edited

Loading